````````````## Where do redditors want to go?
Analysis of posts from [IWantOut](https://www.reddit.com/r/IWantOut/), a reddit forum containing posts of people who would like to move to another country. Almost all posts in the reddit forum contain the origin country of the redditor and his/her dream destination country/countries.
Below is the Sankey Diagram showing the origins and desired destinations of redditors. A wider light puple line indicates a higher number of posts. You can view an expanded version of the Sankey diagram [here](https://rouenlee29.github.io/reddit-iwantout/iwantout_sankey.html).
Most redditors come from North America (not surprising, reddit is most popular in the US). Europe is the most desired destination (should I be surpised?).
## Who are the redditors
## Analysis of posts
I looked at the posts of redditors by destination region
### Words with high Tf-idf scores, by destination region 1
- One _document_ is a collection of ALL posts with the same destination region. For example, all posts with destination as Asia will be collated into one document.
- Every word in a document will be assigned a _Tf-idf score_.
- A word in a document has a high Tf-idf score if it appears almost exclusively in the document and appears very frequently in the document. [Read more](http://www.tfidf.com/).
- The bigger the word in the diagrams below, the higher its Tf-idf score.
Europe | Africa
:-------------------------:|:-------------------------:
 | 
Oceania | Asia
:-------------------------:|:-------------------------:
 | 
Europeaan Union2 | North America
:-------------------------:|:-------------------------:
 | 
Observations
- The word "work" appears very frequently across all posts.
- Seems like "visa" is a concern to those wishing to go to Western countries.
- Those looking to move to the Euopean Union seem aware of the potential need to learn a new language
## Analysis of jobs
### Words with high counts, by destination region
Some redditors reveal their jobs in the post titles. I have extracted this information and performed a word count by destination region. Seems like North America and Europe are attracting people with different skillsets:
Europe | North America
:-------------------------:|:-------------------------:
 | 
## Notes
1. What do you mean by _region_?
A region is more than a country, but less than or equal to a continent.
2. What is the difference between _Europe_ and _European Union_?
- If a redditor writes a specific destination country (for example, United Kingdom, Netherlands or Germany) in Europe, the destination region will be _Europe_.
- If a redditor writes "EU" as his/her destination, then the destination region will be _European Union_.
Code to scrape, transform and create the charts can be found in [my github repo](https://github.com/rouenlee29/reddit-iwantout).